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ABSTRACT 



Decision makers are often confronted with problems for which there 
exist several distinct measures of success. Such problems can often 
be expressed in terms of linear or nonlinear programming models with 
several "criterion" functions instead of single objective functions. 

A variety of techniques have been applied to multicriterion problems, 
but the approach used here, "The Dynamic Programming Approach to Multi- 
criterion Optimization Problem," is based on the concept that the ideal 
solution to a multiobjective problem must be a pareto optimal solution. 
In many cases simply narrowing the set of candidate solutions to the 
set of all pareto optimal solutions may enable the decision maker to 
find the compromise being sought. The determination of nondominated 
points and corresponding nondominated values (pareto optimal solution) 
related to the multicriterion optimization problem is approached through 
the use of dynamic programming. The dynamic programming approach has an 
attractive property which provides the basis for generation of non- 
dominated solutions at each stage by the decomposition method. By 
using recursive equations we can find out the nondominated points and 
corresponding nondominated solutions of multiaggregate return function. 
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I . INTRODUCTION 



A. PROLOGUE 

Situations requiring a decision typically involve multiple conflict- 
ing goals which cannot be satisfied simultaneously due either to the 
inherent conflict among some goals (e.g., maximizing comforts in 
traveling between two points vs. minimizing cost of travel. Maximizing 
effectiveness vs. minimizing cost) or to a scarcity of resource (e.g., 
maximizing productions on each of several individual production lines) . 

Since first described by Pareto many years ago, the problem of 
multicriteria, or vector valued optimization has fascinated economists, 
planners and engineers. Although it has also been considered in a very 
abstract framework by a number of researchers the most commonly en- 
countered form of the problem seems to be as follows: 

Maximize [f^x), f (x) , . . . , f L (x)] 
subject to g^(x) 0, i-l,2,...,m 

where x is a vector. 

During the past 10 years increased research in this area has been 
directed in several directions reflecting different classes of multiple 
objective decision problems. An excellent recent summary of this area 
is by K. R. MacCrimmon, [14]. Roy [15] summarized several approaches in 
1971 and Johnsen [16] offers an almost encyclopedic coverage of the 
field to 1975. 
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A lot of researchers solved the MCO problem by using the concept of 
a nondominated solution, Yu [l7] introduced a general concept of domi- 
nation structures; the idea helps to view the one-dimensional comparison 
and the set of nondominated (pareto optimal) solutions as two special 
cases of a more general solution concept. Zeleny [l8] develops a multi- 
criteria simplex method for generating all nondominated extreme points. 
Polak [l9] presents an algorithm which constructs an economical grid of 
noninferior points to be used in computation with an interpolation 
scheme in the value space. The algorithm is specifically designed for 
the bicriteria case, but it can also be used in higher dimensional 
situations . 

The algorithm we shall present in this thesis is specialized to the 
two or more criteria space with only a few equality or inequality type 
constraints (only one constraint in our example) . 

The theory of the dynamic programming approach to the multiple cri- 
teria optimization problem [MCODP ] is developed by combining the non- 
dominated solution concepts and dynamic programming method. 

Using the dynamic programming decomposition theorem, we break the 
multicriteria problem into N stages. At each stage we find the non- 
dominated solution by a recursion equation and hence solve the MCO 
problem by finding the nondominated solutions of multi -aggregate return 
function. 
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B. MULTI CRITERION OPTIMIZATION PROBLEM ANALYSIS - 
A REVIEW OF THE LITERATURE 

Maximize F(x) 

subject to g^(x) < 0 for i=l,2,...,m 

where x is an N-vector (x^, x 2 ,..., 

an L-vector with scalar components (f_ (x) , f (x) , . . . , f (x) ) . The 

12 L 

solution to this problem is the set of all x which are nondominated and 
which satisfy the constraints g^(x) < 0 for i=l,2,...,m. 

A solution vector x is nondominated if it satisfied the constraints 
and if there does not exist another x' also satisfying them and having 
the properties that 

f^(x') < f^(x) for i=l,2,...,L 
f^(x') > f^(x) for at least one such i. 

Although interest in multicriterion problems is quite recent, the liter- 
ature concerning these problems is already voluminous . 

For a bibliography and synthesis of work in this area see 
MacCrimmon [ 2 ] . In his paper, written in 1973, he discussed every 
method which was developed up to then and attempted to group them into 
four main categories to facilitate comparisons among the various methods 
They are 

a) weighting methods 

b) sequential elimination methods 

c) mathematical programming methods 

d) spatial proximity methods. 



x^) , and F(x) , a function of x, is 
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The class of weighting methods has received the most attention and 



particular models within this class have been the most widely applied. 
Although these weighting methods seem very diverse, they all have the 
following characteristics: 

- A process comparing attributes by obtaining numerical scalings of 
attribute values (intra-attribute preference) and numerical 
weights across attributes (inter-attribute preference) . 

MacCrimmon divided the weighting method into nine weighting methods. 
Some of them are: 

linear regression 
trade-offs 

simple additive weighting. 

The linear regression method can be used when the problem is in such 
situations that are sufficiently repetitive to group together. Attri- 
butes considered by the decision maker can be treated as variables in 
a linear model. From his past choice, coefficients of the attributes 
are estimated using standard regression techniques. 

Slovic and S. Lichtenstein [3] provide a very comprehensive survey 
of linear regression models and have an excellent bibliography. 

The trade-off methods differ from the linear-regression method in 
that the preference of the decision maker is obtained by directly asking 
him his preferences rather than by inferring them from his past choices. 
Although this method has the advantage of obviating the need for a con- 
siderable past history of similar situations, it has the disadvantage 
of possibly finding that the decision maker is unable to verbalize his 
true preferences. 
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In the trade-off approach, the marginal rates at which the decision 
maker is willing to trade one attribute for another are obtained by 
direct questioning. If all the relevant trade-offs can be made, then 
the focus of the problem can be narrowed to one attribute. Raiffa [5] 
utilizes a more general trade-off method on the very complex problem of 
obtaining certainty equivalents for lotteries in multiple attribute 
decision problems . 

In the simple additive weighting method, for each of the attributes 
decision problems. 

In the simple additive weighting method, for each of the attributes 
the decision maker assigns importance weights which become the coef- 
ficients of the variables. To reflect his marginal worth assessments 
within attributes, the decision maker also makes a numerical scaling of 
intra-attribute values. He can then obtain a total score for each 
alternative simply by multiplying the scale rating for each attribute 
value by the importance weight assigned to the attribute and then summing 
these products over all attributes . 

After the total scores are computed for each alternative, the alter- 
native with the highest score is the one prescribed to the decision 
maker. Simple additive weighting is very widely used. One important 
use of this technique was in the mustering out system of the U. S. Army 
at the end of World War II [6]. 

Sequential elimination methods are less demanding of the decision 
maker than weighting methods. Sequential elimination methods are 
characterized by a process for sequentially comparing alternatives on 
the basis of attribute values so that alternatives can be either elimi- 
nated or retained. 
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MacCrimmon distinguished among four methods. One of these is the 
dominance method which compares one real alternative against another 
real one to see if proper alternatives can be eliminated. If one 
alternative has attribute values that are at least as good as those 
of another alternative for all attributes , and if it has one or more 
values that are better, then the first alternative is said to ,, dominate ,, 
the second, and the second alternative can be eliminated. Although this 
is probably the least controversial decision rule, it unfortunately does 
not often succeed in eliminating very many alternatives. 

Terry [7] shows how dominance can be used as an initial filter in 
choosing among new product areas for comparing diversification. Freiner 
and Yu [8] combine dominance with the mathematical programming approaches. 

The class of programming methods has recently begun to receive much 
attention. MacCrimmon divided the mathematical programming methods into 
three methods, linear programming, goal programming and interactive, 
multi-criterion programming. 

A regular linear programming model may be viewed as a multiple 
attribute decision method. The variables are the attributes. The 
linear constraints are conjunctive constraints on combinations of 
attributes, and there is a linear, compensatory objective function. 

Efficient solution algorithms are available. The classic diet 
problem [9] is an example of this method. The objective is to provide 
a balanced diet, through the selection of the amounts of particular 
foods, that will satisfy particular nutritional standards at minimum 
cost. 

In a goal programming formulation, the decision maker specifies 
acceptable or desired levels on single attribute values (i.e., one- 
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variable constraints) or on combinations of attributes (i.e., multi- 
variable constraints) and these serve as the primary goals. 

Lee and Clay tor [10 ] apply goal programming to the scheduling of an 
academic department. The attributes (variables) are defined in terms 
of faculty and non-faculty personnel having varying degree qualifica- 
tions. The goals involve faculty/student ratios, faculty/staff ratios, 
percentage increase in salaries, low cost, etc. Solutions obtained 
under different assumptions about available budget and accreditation 
requirements are presented. 

Ijiri [11] develops the accounting aspects of goal programming. 

Interactive, multi-criterion programming does not assume a global 
objective but rather requires the decision maker to provide his local 
trade-offs in the neighborhood of a feasible alternative. These trade- 
offs (on the attributes or criterion involving attributes) are used in 
a local objective function for a mathematical programming algorithm to 
generate an optimal solution for that objective. The decision maker 
then has an opportunity to provide new trade-offs which again serve as 
inputs to the algorithm. This process continues until the decision 
maker no longer wishes to revise his trade-offs and so an optimal 
solution is reached. 

Geoffrion, Dyer, and Feinberg [12] describe the use of the method 
in scheduling an academic department. Related approaches are given in 
the studies of Benayoun de Montgolfier and Tergny [13] and Yu, Zeleny. 

While the preceding three classes are probably the most common 
general approaches to multi-attribute decision problems, some of the 
more specialized methods are also receiving attention. Since they all 
make explicit use of spatial representation, MacCrimmon calls them 



12 



M spatial proximity methods . 11 He distinguished among three methods in 
spatial proximity methods. One of them is the indfiference map method. 

The decision maker's preferences can be obtained in the form of 
indifference surfaces which show the combinations of attribute values 
that are equally preferred. The alternatives to be considered can be 
located in the same spatial representation, and by identifying the 
indifference surface on which they lie, a complete ordering among the 
alternatives can be generated. 

MacCrimmon [14] gives an example of the use of this procedure for 
transportation system planning. Many combined approaches are possible 
and hopefully this overview will provide some insight into the available 
methods so that readers can begin to consider useful combinations for 
their own multiple objective decision problems. 
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II. BASIC THEORY OF DYNAMIC PROGRAMMING APPROACH 
TO MULTICRITERION OPTIMIZATION PROBLEM 



A. INTRODUCTION 

This chapter is concerned with finding solutions to the following 
general problem, which will be called "MCO (Multicriterion Optimization) " 
problem. 

Find the set of nondominated solutions to 

maximize f^(x) 

f 2 ( x) 

f T (x) 

L 

subject to 

Ax < b 
x > 0 

where A is an M by N matrix 

b is a vector (b, ,b^ , . . . ,b ) 

12 N 

x is a vector (x^ f x^ , . . . ,x N > 

f^(x) is a decomposable function 

The dynamic programming approach to the multiple objective function 
problem is of interest because we can decompose each objective function 
into N-stages in dynamic programming (if separability and monotonicity 
conditions hold) . At each stage, we get the L single-variable return 
functions and we can find the nondominated policy by just eliminating 
the dominated policy. 
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Once we get the optimal returns (nondominated solution) from each 
stage , then we can easily obtain a nondominated solution for a system 
by Bellman's "principle of optimality" stated more succinctly in his 
words - 

An optimal policy has the property that whatever 
the initial state and decision are, the remaining 
decisions must constitute an optimal policy with 
regard to the state resulting from the first 
decision [20]. 

B . NOTATION 

The following notation and basic definitions will be employed in the 
remainder of this chapter. 

1. Multistage Problem Solving 

When solving a complex problem, we often break it into a series 
of smaller problems - decomposition - and then combine the results from 
the solution of the whole problem - composition. We call this approach 
multistage problem solving. 
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[Figure l] 
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2. State (x) 



The state of each stage is defined by a set of numbers called 
state variables 



x = (x 1# x 2 , , x N ) 

We use a subscript n to denote the value of the state variables after 
stage n - see (Fig. 1) . 

3. Decision Variable (D) 

The decision variables are those quantities that can be con- 
trolled or chosen in the design and operation of any stage 



D = (D 



]_' u 2' * * * ' 



D 



n 



V 



See (Fig. 1) . 



4 . Stage Transformation 

Each stage of the process transforms the state of its input into 
an output state in a way dependent on the decision that has been made 
for the operation of the stage. We express this symbolically by writing 



n-1 



= t (X , 
n n 



D ) 
n 



meaning that given X and D it is possible to calculate X _ . 

n n n-l 

5 . Multiple Return Function 



Each stage has L return functions. We call this a "multiple 
return function" and the notation is as follows 
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1 



where 




n = 1,2,..., N (stages) 

l = 1,2,..., L (number of return functions) 



6 . Multiple Aggregate Return Function 



th 

The total return R of the i return from stage one through N 
th 

is some function of the Z individual stage returns written as 



R = 



R 

L_ J 



R = 



Zr Z l 

9 ( r N • r N-l • 



•/ r x ] 



£ = 1,2,..., L (number of return function) 

7 . Inequality Notation : 

If the objective functions of an optimization problem is vector 
valued, such that 



Q = [Q\ Q 2 qV, 



we define a partial ordering of elements Q as follows: 
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Q < Q Q” < Q " 



— o — * 01 x i 

Q<Q +-> Q < Q » 



* i 



^ ^ j 9-Q° j ^ Q j 



O * 



Ol 



*1 



Q < Q ^ Q < Q 



— n — * 



Ol 



Q = Q ++ Q = Q 



v-. 

1 



8. Notation of Nondominated Points 



Given a set of feasible points 



VCR 



n 



and a vector criterion function 



f : R n -* R P 



then the set of nondominated points is 

SI = {x*e V | f(x^) > f(x*) = > x 1 



V}, 



9. Notation of Nondominated Values 



SI = {Z = R I 2 = f(x) , xeSl} 



The set of all nondominated solutions in the criteria space T(X ) will 

N N 



be denoted by 



S I T » < V 1 



V“' D 1 



where yy - (g<r N <y D„) , * D n V 
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that is, T(X . ) is the set of all feasible values of the vector 
N N 

9 ( VW' Vi'Vi'Vi 1 '— • r i (x i' D i )) fora11 V“ 'V 



subject to 



X = t (X , D ) , n = 1,2, N 

n-1 n n n 



and 9( WV' Vl'Vr'N-l 1 '"-' r 1 (X 1 ,D 1 )) is ^ 
the multiple aggregate return function. 

The optimal return (nondominated solution) from N stages is defined as 



VV - n I T »'V1 
V" " D 1 



subject to 



X = t (X ,D ) , n = 1,2, N. 

n-1 n n n 



10 . Multiple Recursion Equation 



Assuming that the optimal return, 



f n (X n ) = ft [ W ] is decomposable , 
D , — — — , D_ 



N' 



then 



where 



VV m « (VV1 

D N 



S N ( V * tg l ( VW ' F »-1 (X »-1 ), ' ¥ V 
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that is, ^(X^ is the set of all feasible values of the vector 



g l (r N (X N'°N ) ' 



n 

Vi'™ ' °1 



t T N-l (X N-l } ^ f ° r a11 values of D n - 



This notation means that each r(X ,D) is combined with each vector in 

N N N 

the set « tT N _ 1 (X N _ 1 )] 

V™' °1 



subject to 



Vi * VVV- 

and let 



VW ■ WW’ Vl'Vl” 



then 



s n ( V - { WV' * d n>- 

Applying the same decomposition to (X^_^) , , F^CX^)/ we obtain the 

multiple recursion equations. 



where 



F (X ) = fl [S (X ) ] n = 1, ,N 

n n n n 

D 

n 



s (s ) = {Q (X ,D ) , V D } 
n n n n n n 



Q (X ) = r (X ,D ) n = 1 
n n n n n 



= r (X ,D ) o F .(X . ,D .) n = 2, ,N 

n n n n-1 n-1 n-1 



and "o" is the composition operator 
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subject to 



X = t (X ,D ) . 
n-1 n n n 



C. BASIC CONCEPTS OF "NON DOMINATED" 



Let us briefly outline the basic concept. We will utilize the 

decision variable D as in Figure 2. The value of decision variable D 

n n 

results in the value of ^(X^D^ through the equation. 



n 



Q (X ,D ) = r (X ,D ) 
*n n n n n n 



oF (t (X ,D ) ) 
n-1 n n n 



* <W 



► Q (X ,D ) 

♦ n' n 



- ®n (X n' D n> 



n = 1,2,...,N (number of stages) 

£ = 1,2,...,L (number of objective functions) 

X = fixed, 
n 

[Figure 2]. The relationship between decision variables and return 
functions . 



Each choice of decision values which is allowable yields a feasible 
solution in criteria space as in Figure 3a. 




[Figure 3a]. Feasible solutions. 
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The full set of allowable solutions obtained by mapping all allow- 
able values of the decision variable into the criteria space yields 
some volume in that space, again as in Figure 3a. 

In general, the majority of these feasible solutions will be domi- 
nated, that is, a feasible solution will exist which is at least as good 
in all criteria and better in at least one criterion. 

A "nondominated" policy can be defined in the following way. A 
point D^* e X^ is said to be a nondominated policy if and only if there 

is no other policy D ^ e X such that 

n n 

0 i (X ,D "S > Q 1 (X , D *) , for all i 
n n n = n n n 

and 

Q 1 (X ,D 1) 7 * Q i (X ,D *) , at least one i 
*n n n *n n n 

where X is fixed, 
n 

That is, a "nondominated" policy has the property that there is no other 
feasible policy which would improve at least one goal variable while the 
others stay unchanged. 

The surface of nondominated solutions in criteria space for the 
example shown is indicated in Figure 3b. 




[Figure 3b]. Surface of nondominated solutions. 
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A set of representative nondominated solutions for the example shown 
is indicated in Figure 3c . 




[Figure 3c]. Representative nondominated solutions. 



D. BASIC THEOREM 



From the general M C 0 problem. 



maximize F(x) 

subject to g(x) <_ 0 for i = 1,2, ...,m 

where x is an N-vector and F(x) a function of x, is an L-vector with 
scalar component (f (x) , f (x) , . . . , f (x) ) . 

X Z L 

To use the dynamic programming method each of the L objective 
functions must satisfy the definition [1] and [2]. 

1 . Definition fl 1 (Separability) 

Let the aggregate return be 



~ g ^ r N' r N-l' — ' r ]J 



"Separability” can be written 



9 Iv Vi r i ] 

9 1 ^ r N' 9 2^ r N-l' ' r iH 
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I 

I 




where and are real-valued functions. 



2 . Definition [2] (Monotonicity) 

From g 2 (r N-l'"" r i )] 

g^ is a monotonically nondecreasing function of g^ for every fixed r^. 
If each objective function satisfies the definition [1], [2], then it 

can be decomposed into N stages. At each stage we get L return 
functions . 



r (X ,D ) = 

n n n 



r (X ,D ) 
n n n 



r ( X , D ) 
n n n 



r (X ,D ) 
n n n 



and 



n = 1,2,..., N (stages) 

& = 1,2,...,L (number of return functions) 



Q (X ,D ) = r (X ,D ) o F . (t (X ,D )) 
*n n n n n n n-1 n n n 



giving 



B„<V D n> - 



«„ <VV 



Q (X ,D ) 
n n n 



Bn ( V D n> 



We assume that there is some overall utility function implicitly 



defined on the Q (X ,D ) . At stage MX is fixed and the feasible 
x n n n n 
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values of D depend on X . 

n ^ n 



U[Q (X ,D ) ] = U[Q 1 (X ,D ), Q L (X ,D )] 

LX n n n J L n n n x n n n J 



This function is unknown but it can be assumed to be real-valued, 

Z 

monotonic and nondecreasing in each argument Q (X ,D ) . 

n n n 

The problem is to find D * for fixed X such that 
* n n 



MAX 

D eX 
n n 



U[Q_(X ,D)] = U [Q (X , D *) ] . 
L n n n n n n J 



3 . Theorem [l] [Geoffrion] 

Let the set of feasible be compact. Then at least one policy 

at which uTo (X ,D ) 1 achieves its maximum over X (fixed) is "non- 
LX n n n J n 

dominated. ,, That is 



MAX U[Q (X ,D 1 
^ v L *n n r n J 
D ex 
n n 



U [WV’] 



where D^*eQ, and Q, = the set of nondominated points. 



PROOF 



By the compactness (bounded and closed) of the feasible set of 



D and the continuity of U there is at least one maximal solution to 



U [Q( X ,D ) 1, say D *eQ. Consider some point D ' eX such that 
LX n n n J n n n 



O 1 (x ,D ') > Q 1 (X ,D *) i = 1,2, ...,L 
n n n = x n n n 



L 

and which maximizes £ O (X ,D ) over X . The point D 1 is obviously 

. .. n n n n ^ n 

i=l 



nondominated since otherwise there would be some D cX with 

n n 



Q 1 (X ,D ) > Q 1 (X ,D 1 ) for at least one i 
n n n n n n 
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and 



k ■ k 

Q (X ,D ) >Q (X , D 1 ) for the remaining k ^ i. 

*n n' n = *n n n 

Then , of course, 

SO L (X , D ) > SO 1 (X ,D ' ) which contradicts the choice 
*n n n *n n n 



of D 1 
n 



Since D 1 is nondominated 
n 



Q (X ,D ) > Q (X ,D ') = >Q (X ,D ) = Q (X ,D ') 

*n n n = *n n n *n n n *n n' n 



V- D e X 
n n 



Because U is monotone increasing, then also, letting = D^* 



Q (X ,D *) > Q (X , D ') = >Q (X ,D *) = Q (X ,D ') 
^n n n = *n n n *n n' n *n n' n 



implies 



U[Q (X , D *) 1 > U[Q (X , D ')] = >U [Q (X , D *) ] 
L *n n' n J = L *n n n J L *n n n J 



- U (WV>1 



so, D 1 also solves 
n 



MAX U[Q (X ,D ) ] = U[Q (X ,D ')] 
L *n n n J L *n n n J 



D eX . 
n n 



Thus, it is shown that at least one policy at which U[Q (X ,D ) 1 

* 1 L *n n n J 

achieves its maximum over X is nondominated 

n 

Q.E.D. 

The theorem provides the justification for solving for nondominated 
solutions - 
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Next, let us briefly explain the theory of the dynamic pro- 
gramming for the MCO problem. In dynamic programming the N-variable 
problems must be broken into N one-variable problems. So our objective 
is to decompose the problem 

vv - « IVVJ 

v~- ' ' D 1 

“ here VV = <5<VVV' Vi'Vi’Vi 1 ' — V <x i' D i )) - * V — v 

that is, ( V is the set of all feasible values of 

i'VVV' £or aU d h-— D 1 

subject to 

X = t ( X , D ) , n=l, 2 , ,N 

n-1 n n n 

into N equivalent subproblems each containing only one state variable 
and one decision variable. Then the solutions from the subproblems are 
combined to obtain the solution to the original problem. To achieve 
this decomposition, a highly restrictive assumption must be made about 
the function g_^. That is, a sufficient condition for decomposition is 
that g^ must be a monotonically nondecreasing function and another con- 
dition for the decomposition is the separability. 

In the case of a single objective function, a sufficient con- 
dition for achieving the decomposition has been given by Mitten [33] . 

We will prove that a condition similar to Mitten* s is a sufficient 
condition for the case of multiple objective function in theorem [2], 
[ 3 ], [ 4 ]. 

To decompose the problem 

vv - “ iyvi 

v— - D i 
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" here VV “ { ^ (r N^^N ,D N^ ' Vl’Vl'W ' ’ r l <X l' D l )) ' ¥D N V 

we must achieve the crucial step of moving the optimization (here finding 

all nondominated solutions) with respect to D ,0. inside the Nth 

N-l 1 

stage return. Let the multiple objective function be separable as 
follows: 



g(r N (X N' D N ) ' r N-l* X N-l' D N-l ) ' ,r l (X l ,D 1 ) } 



g l (r N (X N' D N ) ' 9 2* r N-l tX N-l ,D N-l ) ,r 1 (X l' D l ) ) ) 



then 



VV ■ {9( VW' Vi'VrVi 1, r i (! r D ii)' ¥ V- t i 1 
- 'VVVV- 9 ! ( Vi lx ..rVi , '“'VVi m ’ 



* V™ d i } ' 



From the definition of F^CX^) it follows that 



F N-l (X N-l ) _ n f T N-l (X N-1 } ] 






where 



VA-i 1 - sjfWWW-— ■ ' r i (x i' D i»' v d n -i' — D i 



Then the decomposition is as follows: 



VV - a [ S N<VI 

d n 

where 

VV - VVW' VlVl"' * V 
= '%'VVV'V 'A’V v V- 
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It is clear that we can proceed further by treating and then 

F„ „(X., _) , — ,F (X_) in the same way as F (X ) . We decompose the 
N— 2 N-2 2 2 IN N 

original problem into N one-stage initial state optimization problems. 



l. = n [s 1 (x 1 ) ] 



where S^X^ = {r^X^D^ , V- 



n. F (X ) = n [S (X ) 1 
n n L n n 1 

D 

. n 



where S (X ) = {g(r (X ,D ) , F (t (X ,D ))), VD } 
n n n n n n-1 n n n n 



n. f m <v = n tvy] 



N 



Where VV ■ {9(r N<VV' P H-1 ( VW»' * V 



Stating the N problems more compactly, we have 



F (X ) = a [ S ( X ) ] , n = 1,2, ,N 

n n N n 

D 

n 



where S (X) = {g(r (X.D ) , F (t (X ,D ))), VD } 
n n n n n n-1 n n n n 



= {Q (X ,D ) , V D } 
n n n n 



and Q (X ,D ) = r (X ,D ) 
n n n n n n 



n = 1 



= r (X ,D ) o F (t (X ,D )) n = 2,3, ,N 

n n n n-1 n n n 



This equation represents the usual recursion equation of dynamic program- 
ming, but for our application the usual max or min is replaced by the 
operator Q. 
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We are now in a position to prove the decomposition in the case 
of multiple objective function. Given the separability and monotonicity 



of we must prove 



a tvy] - 12 IVV1 

V — ' D i 



where 



W - WVV' » 2 (t «-l , Vl'Vl , '- C l IX r D l ))l ' 

* V — D 1 } 

VV * VVW- Vl (t »'V D K"'' * V' 

For the convenience of proving the decomposition, we transform 
the multiple aggregate function, 

5| V Vl‘— V V— D l> to 5 (x n' d n- — ' D l> 

by using the stage transformation 



X = t (X ,D ) 
n-1 n n n 



The stage multiple return is 



r = r (X ,D ) . 
n n n n 



From the transformation, it follows that X^ depends only on the decisions 
made prior to stage n (D n+ ^, D N ) and X N , that is, 



X n = “ fc n+ 1 ( fc n+2 ( X n+2 ' °n+2 } ' D n + 1> 



t n+l (X n+2' D n+2' D n+l ) fc n+l (t n+3 (X n+3' D n+3 ) ' °n+2' 



D n + 1 ) 



fc n+l (X N' D N' ,D n+l ) 
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It then follows, by combining the above equation with the return 
function, that the return from stage n depends only on the decisions 

{D n' D n+l' 'V V ^ at is ' 

7 n "VVV " 7 n < WVV'-'W ' D „> 

= 7 n (X N' D N* D n> ‘ 

The multiple aggregate return function R from stages one through N is 
some function of the individual stage returns written as 

7( V x n-i' V V— ' D 1> = 7(7 n< w • 

r N-l (X N-l' D N-l ) ' ' r L < x i ' D 1 > > • 

However, as just explained, (X^^, ,X^) can be eliminated from the 

individual stage returns and consequently from the total return. Thus 
an alternate expression for R is 

1( W~' ' D l> =i'VW' 7 N-l <X N' D N' D ( .-l ) '— ■ 

7 1 (X N' D N'— ' - D i» 

Let the multiple aggregate return function be separable as 

follows 



R( W ' D l } ^1 ^*"N r °N^ ' 9 2 (r N-l (X N ,D N' D N-l ) 

r i (X N' D N’ — 

and g^ be monotonic nondecreasing function of g^ V- fixed r^ 
where 
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Let 

VV - “ n IVVl 

V~ D 1 



where 



VV ^ g i (r N (X N' D N J ' 9 2 (r N-l <X N' D N' D N-l ) ' ' 

F i ( VV d n-1'“' d l ,)) ' * V" ' d 1 ! 

T (X ) is the set of all feasible values of 
N N 

9 l <r N <X N' D N ) ' 9 2 <r N-l <X N-l' D N' D N-l > ' ' 

vw— ' D l»> 



for all D^.D^, — , Dl 

and Q [T (X^) ] is the nondominated solutions among the values in 

V— - D 1 

the set of • 

® [vv ] 

d n 

where 



S N (X N ) g l (r N (X N ,D N^ 



Vr ,D i 



* d n } 
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“ d T N-1 <X N-1 ) " * 9 2 (r S-l (X H' D a' D H-l ) ' r l IX N' D N' '“l 11 



* Vi'-'V 



we must prove 



vv - w 



For notational simplification let 

M =F'( V 

{z) ’VV' 

Note a z e is nondominated in comparison with all other feasible 

values , while awe i- s nondom ^ natec ^ i- n comparison with the values 

enumerated in the evaluation of Q [ S n^^N^* 

d n 

4. Theorem [ 2 ] 

{w} £ {z} 

PROOF 

Suppose g w such that w e {w} 

then from w e {w} = ft [s„, (X ) 1 

N N 

N 

for some D N * ,D N-1*' ,D X * 

” = ^'vi^'V'Vi* 1 - — - 



Then it is false that there exist D ,D , , ,D, such that 

N N-l 1 

g i (r N (X N ,D N ) ' 9 2 (r N-l (X N ,D N' D N-l ) ' ' r l (X N' °N-1 ' - w 
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for any , since w is nondominated. 



But if w £ {z}, this cannot be for either of two reasons: 

(1) either w is dominated in {z} , then there exist 6 ,D xt ., ,D_ 

N N-l 1 



such that 



g l (r N ( W' 9 2 (r N-l (X N' D N ,D N-l ) ' ,r i ( W" ,D 1 )}) - W ‘ 



This contradicts the assumption w e {w} . 

(2) or else w £ {z} can occur because w was not enumerated when 
{z} was computed, 
but this cannot be because 

{"> £ GiVVV- ®2 ( WVV I W'— 'VV 0 !,'— 'D,)” 



v V" -“l 1 



that is. 



< w >£ {T N ( V }| 



This can be proved by the following proof 






fl . iVi'Vi’l i IVi'Vi’l 



We 



add r N D^) to both sides, then 



WW' ° £ ( % <r N (X N'V 



D N-1' ,D 1 



VA-i” 1 ' 



By monotonicity of g^, this equation is true. 



And this can be expressed as follows using the definition of S (X ) , 

N N 

vv 
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W } £ { VV> 



and 



a (VVI ^ { VV } 

3 n 



thus 



a <W> £ ( VV> £ ^ t n ^ ^ 



N 



< w > £ <W >• 

So w was enumerated when {z} was computed. Thus if w e {w}, we also 
have w e {z}. 

5 . Theorem [3] 

{z} £ {w} 

PROOF 

Suppose there exist z such that z e {z} 

(1) Then from 

z e {z} = « [T(XJ ] 

V ,d i 



where 



vv 



9 l <r S (X N' D S ) ' 9 2 !r N-l (X N’ D N' D H-l ) 



r t 



WV"- ' D l ) »' ¥ Vl'-- D l> 



for some D^*, D^* 



Z g l (r N (X N' D N* ) ' g 2 (r N-l (X N' D N* ' ,r i (X N ,D N*' 



and for all D^D^, ,D r 
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The following equality is false: 



g l (r N tX N' D N ) * 9 2 (r N-l (X N' D N' D N-l ) ' ' 

7 1 ( W— • D 1 ))) i 2 



(2) Also if we fix X^D^*, then 



where 



g 2 (r N-l ( VV' D N-l*’' — ' 

d n-i’ ,d i 



T N-1 (X N-1 ) " (g 2 <r N-l lX N' D K ,D N-l l ' ~' r l (X N' D N' ~ '’“l 1 - 



* Vr— - D i K 



Since otherwise its domination (say by, 

g 2 ^ r N-l^^N ,D N*'^N-l^ ' r N-2 °N* ' '°N-2^ ' ' 



would imply domination of z by 



g l (r N ( VV>- 9 2 <r H-l ( W’ D N-l ) ' ' r l ( W' ' 6 i n) 



(by monotonicity of g^) implying z / {z}. 

And now suppose z / {w}, this means either 

(a) z was not enumerated when {w} was computed. But this 
cannot be by (2) , or 

(b) z was dominated in {w}, then there is some D^, ,D^ 

combination such that 
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g l (r N <X N'^N ) ' g 2 tr N-l (X N'^N'^N-l ) ' " ' 

Vvvvi'-' ■ D i m ± 2 ’ 

but this contradicts (1) . 

Thus if z s {z}, we also have z e {w}. 

6 . Theorem f 4 1 

{w} = {z} 

PROOF 

By Theorems [2] and [3]. 

From Theorems [2], [3] [4], we have shown that if is a monotonically 

nondecreasing function of g 2 for every r N then 

« tVV ] = “ t ? N ( V ] 

V ,d i d n 

where 

VV - {g l (7 H ( W' — ' 7 1 (X 1- D 1 ,) - 

* d n'— 'V 

VV = fg l ( VVV' “ IVl'Wl 1 ' * D N>- 

Vi' - " ’ D 1 

Thus, the stage by stage dynamic programming recursion will generate 
exactly the set of all nondominated solutions to the original multiple 
criterion optimization problem. 
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III. BASIC COMPUTATION 



A. GENERAL SCHEME (DISCRETE CASE) 



Computational aspects of dynamic programming approach to MCO concern 
the solution of the multiple recursion equation. In the last chapter we 
defined the multiple recursion equation as follows. 



f (x ) = a [s (x ) ] (l) 

n n ^ n n 
n 



where 



S (X ) = {Q (X ,D ) 
n n n n n 



¥ D } 
n 



Q (X ,D ) = r (X ,D ) 
*n n' n n n n 



n = 1 



= r (XD) o F (t (X ,D )) 
n n n n-1 n n n 



(2) 

(3) 



n = 2 , . . . ,N 



with X = t (X ,D ) . 

n-1 n n n 

First, we start with n = 1 (Stage 1) calculating 



r i (X l' D l ) = 



r l (X l ,D l ) 



r i (X l' D l ) 








since n = 1. From Equation (2) , 

^1 (X 1' D 1 ) = ^.(Xi'D^ • 
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Equation (1) is then used to obtain F^X.^) D^(X^) by using the non- 

dominated method. These are saved for future calculations. But and 
are no longer needed. Since n ^ N, we increase n by 1 and calculate 



r 2 (X 2 ,D 2 ) 



for n = 2. is calculated from Equation (3) by appropriately combining 

r^ and F^. has now served its purpose and may be discarded. 

The calculations continue similarly for n = 3,...,N. Thus we obtain 
F^fX^), the nondominated solutions from the N-stage system, and D^CX^), 

(n = 1,...,N), the optimal nth-stage decision function of the nth-stage 
inputs . 

The next step is to determine the optimal inputs X* (n = 1,...,N-1) 
and the optimal decision D* (n = 1,...,N) . We have reached the point in 
the flow chart (see Fig. 1) where, for the first time, the answer to the 
question "Does n = N?" is "yes." 

* * 

We assume there is a prescribed value of X^ denoted by X^, is 
obtained immediately from the stored function D^fX^) . Then X^ ^ is 
calculated from = t^fX^*,^*) • Since n ^ 1, we compute from 

X* N _^ and the stored function D n- 1^^N-1^ * When the answer to the 
question "Does n = 1?" is "yes," the optimal solution is obtained. 
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© 




[Figure 1] . Flowchart of General Computation 
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B . TABULAR COMPUTATION 



The term tabular computations is used to describe dynamic programming 

analyses in which the optimal decision functions and optimal return 

functions are given in discrete form in lists or tables. In other words , 

for each feasible value of X , n=l,...,N there is an entry in a table 

containing values of D * (X ) and F (X ) . (See Table 1.) 

^ n n n n 

Naturally , it is only possible to give this information in a table 

when there are a finite number of feasible values for X . But tabular 

n 

representation of continuous data is possible if we make discrete 
approximations. For example, suppose the feasible values of X are the 



integer numbers from 1 to K . For X = K (=1,...,K ). D * (X =K) is the 

n n n n n 

set of all D which are nondominated points of 



and F n (X n =K) is the nondominated solution value of the above expression. 



from 1 to K . Suppose that for each feasible value of X. (0,1,2, ... ,K ), 
n In 



For each value of X^,r is calculated when X^K. The multiple 

return functions are 



n 



n 




At the first stage, the feasible values of X^ are the integer numbers 



the feasible decision variable value is = 0,1 



, • • • , 



(See Table 1.) 
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and is a nondominated point if there does not exist another such 
that 

r*(K,D^) > r^K^) ^ 

r l^ K,D l^ ^ r^(K,D^) At least one i 
D x = 0,1,2, ... ,K D x f D[ 

= 0,1,2, . . .,K 

that is 

f.(x ) = n [? (x ) ] 

D i 

where 

? 1 (X 1 ) = {Q 1 (X 1 ,D 1 ) V Dj} 

^1 (X 1 D 1 ) =r 1 (X 1 D 1 ). 

The tables of F^(X^) and D*(X^) are save< ^ f° r f uture calculations. 

(See Table 2.) 
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(SUMMARY TABLE) 





T3 
















03 














T3 


V 












c 


CD 


03 




o 




X 




X 


-P 


G C 














03 


•H 0 




II 




II 




II 


•H 


£ -H 


G 












O 


0 -P 


X 


c • 


• 


• G • 


• 


• G 


0 


TD 3 




X 




X 




X 


C/3 


G H 


c 






— * 




— " 


C/3 


0 OlP 


c 




c 




G 


<c 


g; cn 




IIP 




IP 




1 P 




CD 
















• H 
















X 
















03 
















•H 
















TJ P 
















CD 03 
















4J > 








,-v 




G 


<D 


03 




O 




X 




X 


-P 


G G 


c 












03 


•H 0 


X 


II 




II 




II 




£ -H 


— 1 












U 


0 03 4c C 


c • 


• 


• G • 


• 


• G 


0 


Tl -H 


Q 


X 




X 




X 


C/3 


G O 




— ' 








— ' 


C/3 


0 03 




4c G 




4c G 




- 4c C 


< 


£ Q 




Q 




Q 




Q 


C/3 
















03 


C 














3 


X 














«H 
















03 
















> 


CD 
















i-H 














03 


X 














iH 


03 














XI 


•H 












G 


•H 


P 




O 




X 




X 


C/3 


03 














03 


> 




II 




II 




II 


<D 
















P 


03 




£ • 


• 


• G • 


» 


• G 




4J 




X 




X 




X 


4-1 


03 














0 


-P 
















cn 














-p 
















U 1 


P 














•H 


0 














PI 


4-4 















44 



Finally, suppose that at the last stage (N) , the feasible value of 

the state variable is just one (K^) and the feasible values of decision 

variable are 0 through K^, and every entry in the table is the same as 

in Table 1. The calculations are basically done in two parts; first, 

the calculation of F (X ) and D*(X..), D*(X ), , D*(X ) and the second, 

NN 11 22 NN 

the tracing of the nondominated policy D^, ,D^. 

At the last stage we get X * = K and D * then by transformation 

X , = t (X* *,D *) we can compute X* „ and by the summary table of 
N-l N N N N-l 

Stage N-l (we saved each F N-1 (X^ ), X=l, /K N f D n- 1 ^^-1^ * We can 

compute D*_^ (X N . We will repeat this until n=l; then at least we will 

9 et d n' d n-i'""' d i* 

C. AN EXAMPLE: ONE STATE VARIABLE AND IRREGULAR RETURNS AND TWO 

OBJECTIVE FUNCTION 

This example shows how tabular computations are organized and 
provides some clues to the number of computations involved when the 
recursion equations are solved by enumeration. 

Example Problem. 

Five OA students meet to work the homework problems in two courses, 

D. P . and Stochastic Models; both have three problems. They agree to 
split into three teams... each team assigned one D.P. problem and one 
stochastic problem. The probability of success on a problem depends on 
the number of students working on it. (No student can work on more than 
one team. Assume all students' capabilities are equal.) 
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D.P. 



Number 
of Students 


Probability 


Problem 1 


Problem 2 


Problem 3 


0 


.0 


.0 


.0 


1 


.1 


.6 


.3 


2 


.2 


.8 


.5 


3 


.3 


.9 


.6 


4 


.4 


.9 


.7 


5 


.5 


.9 


.7 



Stochastic Project 



Number 
of students 


Probability 


Problem 1 


Problem 2 


Problem 3 


0 


.0 


.0 


.0 


1 


. 7 


.2 


.1 


2 


.8 


.3 


.4 


3 


.9 


.4 


.5 


4 


.9 


.5 


.6 


5 


.9 


.6 


.7 



How many students should be on each team to maximize the joint proba- 
bility that all problems in D.P. are solved and the joint probability 
that all in Stochastic are solved? (At least one student must be on 
each team.) The formulation of problem. 



Maximize 



3 

7T 

n=l 



P 1 (D ) 
n n 



3 

ir 

n=l 



P 2 
n 



(D n> 
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where 



3 

Subject to Z D _< 5 
n=l 

D > 0 , D = integer 
n — n 

P 1 = D P probability 
2 

P = stochastic probability 
n = 1 , 2 , 3 (number of problems) 



Multiple recursion equations 

F^X^ = IT [S^X^], Where ? (X^ = {p (X^D^ , ¥• 

F 2 (X 2 ) = n [S (X )] f where S 2 (X 2 ) = {Q 2 (X 2 ,D 2 ), V 

°2 — — — 

W = p 2 (x 2 ,d 2 ) o f 1 (x 2 -d 2 ) 

(Q 3 (X 3 ,D 3 ), ¥ D 3 ) 

F 3 (x 2 ,d 2 ) o F 2 (x 3 -d 3 ) 

Computations 



Stage 1 



X 1 


D i 


p i (D i> 


p i (D i> 


W 


1 


1 


.1 


.7 


.1 .7 


2 


1 


.1 


.7 






2 


.2 


.8 


.2 .8 


3 


1 


.1 


.7 






2 


.2 


.8 






3 


.3 


.9 


.3 .9 



w 



= ft [s,<X,> ], where S,<x ) - 



Q 3 (x 3) 



- r 1 2 n T 
P = [P , P ] 
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Summary Table 1 




Stage 2 



X 2 


°2 


t 2 ( X 2 , D 2 ) 


P 2 ( D 2 > 


P 2 , D 2 > 


5 l (t 2 


( x 2 , d 2 )) 


5 2 ( X 2 


,° 2 ) 


P 2 ( X 2 ) 


2 


1 


1 


.6 


.2 


.1 


.7 


.06 


.14 


.06 


.14 


3 


1 


2 


.6 


.2 


.2 


.8 


.12 


KD 

i — 1 
• 


.12 


i — 1 

• 




2 


1 


.8 


.3 


.1 


.7 


.08 


.21 


00 

o 

• 


.21 


4 


1 


3 


.6 


.2 


.3 


.9 


.18 


00 

1 — 1 
• 


.18 


.18 




2 


2 


.8 


.3 


.2 


.8 


.16 


.24 


.16 


.24 




3 


1 


.9 


.4 


.1 


.7 


.09 


.28 


.09 


.28 



Summary Table 2 



X 2 


^2 


< X 2 > 


° 2 * ( X 2 ) 


2 


.06 


. 14 


1 


3 


. 12 


. 16 


1 




.08 


.21 


2 


4 


.18 


. 18 


1 




.16 


.24 


2 




• 

o 


. 28 


3 



Stage 3 
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By back tracking we find the nondominated solution to contain two parts 



imply : 



(1) D 1 = 2 




D.P . = 0.06 
Stochastic = 0.064 

D.P. = 0.04 
Stochastic = 0.084 
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0 

0 

0 

0 

0 . 

0 , 

0 

0 



( 1 , 2 , 2 ) 




( 2 , 1 , 2 ) 



* j i 1 1 1 D. P. 

0.01 0.02 0.03 0.04 0.05 0.06 

Figure 2. Nondominated solutions in two-dimensional 
criteria space. 



D. THE CONTINUOUS VARIABLE CASE 



Let us consider the problems involved in solving MCO problem when 
we no longer require that the D ^ be integer. We shall then also drop 
the restriction that a_. , be integers. That is 



vv ■ a tvy]' 

d i-— - D » 



where 

VV = ,r 1 (X 1 ,D 1 )) , 

w d n' — 'V 

In carrying out the optimization we consider only non-negative D_. which 
satisfy 

n 

E a.D. < K . 
j=i ^ - n 



Precisely as be fore , 



f x (x ) = n [s 1 (x 1 )] 



©<0, <K /a n 
— 1— n 1 



where 



S 1 (X 1 ) = {r 1 (X 1 ,D 1 ), V D.^. 

Note that D_ varies up to K /a,, not fK /a n 1 . The same recurrence 
1 ^ n 1 L n 1 J 

relations follow that applied to the case where the D ^ were to be integer. 
That is 



F (X ) = ft [S (X ) ] 

n n __ , L n n J 

o<D <K /a 
— n— n n 
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where 



S (X ) = { Q ( X , D ) , V- D }, 
n n n n n n 



Q (X ,D ) = r (X ,D ) o F _ (X -a D ) 
*n n n n n' n n-1 n n n 



In order to apply the dynamic programming computation using the 
tabular form, the decision variables must be discrete. So first we 
must decide on a discrete grid to approximate the continuous variables. 
If the grid size is A, the length of interval (L) then the number of 
points (p) on which the function is defined is 




Thus for the fixed L, p is inversely proportional to A. If p is too 

large, then the D.P. is almost impossible to calculate, we must reduce 

the size by making the grid coarser. That is, by changing the grid from 

A to a larger value 5 there is a corresponding decrease in p. 

To see how the computations decrease as the grid increases, suppose 

we want the optimization of Q (X ,D ) for fixed X , and o<D <20,000 on 

n n n n — n— 

the grid d^ = 0.1, ,20,000. If we start initial grid, there are 

20,001 points at which the function is to be evaluated. 

By increasing the initial grid size to d = 0, 100, 200, ,20,000 

and evaluating the function at 201 points, the interval of optimality is 
reduced to a length of 200. Then another 201 evaluations of the function 
will yield the integer solution. But the number of computations can be 
reduced still further by starting with a very coarse grid d = 0, 1000, 

2000, , 20,000. Then the interval of optimality is reduced to a length 

of 20. Then another 21 evaluations of the function will yield the inte- 
ger solution. 
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Assume that the function F , (X . ) is known, then Q is defined 

n-1 n-1 n 

over a continuum of values D in the interval o<D <K /a . 

n — n — n n 



n 



If Q 5 
*n 



1 2 

is plotted (as Q (X ,D ) vs Q (X ,D ) 
* * n n n * n n n 



n 



(at fixed X ) 
n 



it might look like a space or a curve. (See Figure 1.) 




And the nondominated solution set is the thick line. We would like to 

determine the set of values D (X ) at which Q assumes its nondominated 

n n *n 

solution. To reduce as much as possible the computational effort 

required, one might do a rough initial search using a coarse grid of 

D , (D = 0,6,26, D ). (See Figure 2.) 

n n n 



I 






i 
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From this coarse grid of decision variables an approximate nondominated 
policy would be found from Figure 2 , points A and E . Then we can 
conclude that 



d 5* ■ 6 1 dA * n £ d6 * + 5 

that is, 0 <_ dA*n 5 

35 < dA*n < 55. 



A finer grid would be used in the neighborhood of the nondominated 
point to determine more nondominated values or to locate them more 
accurately. The procedure of using a finer grid might be repeated 
several times to obtain the desired number of nondominated points in 



D 

n 



<X n >. 



* '| M I 1 « » < M 

A B C D E F 



n + 



[Figure 3] 




E 



o 




n 
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Consider next the problems which arise in the tabulation of the 

F (X ) . A difficulty is encountered here because F (X ) is defined for 
n n n n 

all X , o<X <K . Furthermore, examination of the search procedure 
n — n— n 

introduced above shows that F^fX^) may actually be needed for any X^ in 

this interval, due to the fact that when finer and finer grids are used 

to determine F (X ) for a given S , F . (X _) may be required for a 
n n n n-1 n-1 

very large number of different arguments. It would usually be impossible 

to tabulate the F (X ) for all possible X that might be required with 
n n n 

the search procedure described above. Clearly, if this cannot be done, 

we must tabulate the F (X ) for a smaller number of X . 

n n n 

In dynamic programming the main benefit of a coarse grid on the 
decision variable is that it produces a corresponding coarse grid on 
state variables. 

In the relation 



X = t (X ,D ) 
n-1 n n n 



the number of feasible values of X^_^ is roughly proportional to the 

number of feasible values of D^. By reducing the number of feasible 

values of X w we evaluate F n (X - ) fewer times and the computational 
n-1 n— 1 n— 1 

procedure is sames as discrete case. The successive grids used in a 
dynamic programming analysis depend on the number of values tolerable for 
the state variables and the accuracy required. 



E. COMPUTATIONAL EFFICIENCY OF THE METHOD 

Let us now compare the computational efficiency of the dynamic pro- 
gramming approach to solving the MCO problem with that of simply 
enumerating all possible sets of non-negative integers D. which satisfy 
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the constraint 



N 

E a . D . <Kn . 



For illustrative purposes 



N 

let a. = 1 then Z a.D.<K become 
3 j=i 3 n 



N 

E D . <Kn 

3=1 



(1) 



and only one constraint and one variable. 
1. Direct Enumeration 



In the case of N stages and L objective functions and the 
constraint is (1) then the number of evaluations of objective functions 
is 



Kn 
L E 
K=0 



,n+K-l 
{ K 



) 



and the number of comparison for choosing nondominated solutions is 



Kn [( n+ J -1 )-l] ( n+ JJ 1 ) 
_ K K 

k'o 



12 L 

This shows the number of comparison of vectors (f , f , ,f ) . 

2. D.P . Approach 

At each stage, there are (Kn+1) x values and for each value of x 
there are (x+1) D values. The number of evaluation of objective functions 
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at one stage is 



L Z (x+1) = L( (Kn+1) + Kn( ^ n+1) . 

x=0 



All (N-l) stages have the same number of enumerations 
L x (N-l) [Kn+1) + — - ( - ^ - + - 1 -] 

and last stage has just (Kn+1) x L computations. Thus 

[ (N-l) ( (Kn+1) + K ft ( yP + . 1) ..) + (Kn+1)] x L 

and the number of comparisons for choosing the nondominated solutions is 
at each stage 

Kn 

Z x ( x+ 1 ) 
x=l 2 

and since all (N-l) stages are the same, the total number is 

Kn / n x 

(N-l) E ilSiL. 

X=1 



At the last stage, the number of comparisons is 
thus the total number in N-l stage is 



Kn (Kn+1) 
2 



,, r- x (x+ 1) , Kn (Kn+1) 

(fj-1) z -J — * — 5 . 

X=1 

Therefore total number of comparisons for all stages is 



(N-l) [jj(Kn+l) (2Kn+l) 



Kn (Kn+1) , Kn(Kn+l) 

+ ■ > + — 

4 2 
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For example (2 objective functions) 



DIRECT ENUMERATION 





10 


50 


100 


1000 


3 


572 


4.7xl0 4 


3. 5x10 5 


8 

3.3x10 


5 


6x10^ 


6.9xl0 6 


1.9xl0 8 


13 

1.7x10 


10 


3 . 7x10^ 


1.5X10 11 


13 

9.2x10 


23 

5.8x10 



DYNAMIC PROGRAMMING 



N\ 


10 


50 


100 


1000 


3 


2 86 


5. 4x10 3 


4 

2.1x10 


2x10^ 


5 


550 


lxlO 4 


4.1xl0 4 


6 

4x10 


10 


1210 


2. 4x10 4 


9.2xl0 4 


6 

9x10 



Notice that the direct enumerations grow exponentially and the dynamic 
programming approach increases linearly with N. 
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IV. CONCLUSIONS 



We have discussed the concept of the nondominated solutions and have 
shown how the concept of the dynamic programming can be applied to 
determine the nondominated point set. 

We have also discussed the sufficient conditions for the decomposi- 
tion of multiple criteria functions. Problems involving both the 
discrete and continuous case can be handled. 
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